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O ■ 1 Introduction 

<N ' 

Variable neighborhood search (VNS) 0[5] is a metaheuristic for solving optimization problems based on 
a simple principle: systematic changes of neighborhoods within the search, both in the descent to local 
minima and in the escape from the valleys which contain them. VNS applications have been numerous 
and successful. Many extensions have been made, mainly to be able to solve large problem instances. 
Nevertheless, a main driver behind VNS is to keep the simplicity of the basic scheme. Variable neigh- 
borhood descend (VND) belongs to the family of methods within VNS. The idea behind VND is to sys- 
^ \ tematically switch between different neighborhoods where a local search is comprehensively performed 

^ 1 within one neighborhood until no further improvements are possible. Given the found local optimum, 

VND continues with a local search within the next neighborhood. If an improved solution is found one 
may resort to the first neighborhood; otherwise the search continues with the next neighborhood, etc. 

Designing these neighborhoods and applying them in a meaningful fashion is not an easy task. More- 
over, incorporating these neighborhoods, e.g., into VNS may still need a considerable degree of ingenuity 
and may be confronted with the following research questions (among others): 

• Which neighborhoods can be designed? 

• How should they be explored and how efficient are they? 

• How structurally different are they? 

1 

• In which order should they be applied? 
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In this paper we attempt to investigate especially the latter concern. Assume that we are given an 
optimization problem that is intended to be solved by applying the VNS scheme, how many and which 
types of neighborhoods should be investigated and what could be appropriate selection criteria to apply 
these neighborhoods. More specifically, does it pay to "look ahead" (see, e.g., @ in the context of 
VNS and GRASP) when attempting to switch from one neighborhood to another? Is it reasonable to 
apply "nested neighborhood structures" (like first 2-optimal exchanges, then 3-optimal exchanges, then 
4-optimal etc.) or should neighborhood structures be considerably different from each other? Often 
different neighborhoods are proposed allowing for ideas regarding sequential and nested changes. 

We cannot provide a comprehensive survey on the VNS in this extended abstract. We emphasize just 
very few earlier works on VNS investigating some of the questions raised. 

For instance, in 12 VNS is hybridized with a genetic algorithm (GA). Chromosomes in the GA 
represent the possible choices of neighborhoods to be applied. However, as the authors state, "the order- 
ing of neighborhoods is unimportant since VNS cycles through all neighborhoods." On the other hand, 
when hybridizing the VNS with the pilot method as it is performed in 01 it actually seems favorable 
to put some effort in an appropriate choice of the next neighborhood. Problems considered are exam 
timetabling in the first case and telecommunications network design in the latter. These results are in line 
with the observations in Q for applying VND on a different network design problem. 
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To summarize, different sources come to different conclusions making it even more important to 
investigate these topics further. While our research questions are of generic nature we attempt to in- 
vestigate them and provide answers with respect to the single machine total weighted tardiness problem 
(SMTWTP) as a specific combinatorial optimization problem. Though this setting might not necessarily 
lead to generalizable results (as is the case with the references cited above) they seem to allow additional 
insights into the VNS method in itself. 

2 VNS for the Single Machine Total Weighted Tardiness Problem 

The SMTWTP or 1 1 | Y^WjTj is a well-known machine scheduling problem. In the SMTWTP, a set 
of n jobs with weights indicating their relative importance needs to be processed on a single machine 
while minimizing the total weighted tardiness. For a recent paper on the SMTWTP plus some related 
references see, e.g., O. 

With respect to our above mentioned research questions, the SMTWTP may be seen as a represen- 
tative problem with the following characteristics: easy to represent by some straightforward encoding, 
hard to solve, an existing testbed of problem instances with known optimal solutions is readily available 
for detailed analysis. 

Neighborhoods for the SMTWTP. The set of neighborhoods to be explored should be important 
enough to get some insights regarding our purpose. We have decided to build a set of neighborhoods 
composed by the classical ones and by some structurally different neighborhoods. 

The investigated neighborhoods include the following: 

APEX it is the most classical neighborhood for permutation encodings. Two adjacent jobs are ex- 
changed in the sequence. For a solution, the size of this neighborhood is n — 1. 

BR4 consists in taking a block of four consecutive jobs and inverse its internal orientation. The size of 
this neighborhood is n — 3. 

BR5 is identical to BR4 but with a block of five jobs. The size of this neighborhood is n — 4. 

BR6 is identical to BR4 but with a block of six jobs. The size of this neighborhood is n — 5. 

EX\APEX is the general pairwise interchange where the APEX is excluded. Two non-adjacent jobs are 
exchanged. The size of this neighborhood is n(n — 3) + 2. 

FSH\APEX takes a job and moves it to a position further in the sequence, resulting in a forward shift 
for the jobs in between these two positions. APEX is excluded from this move. The size of this 
neighborhood is Ya=i i = (n - 2)(n - 1) /2. 

BSH\APEX works as FSH\APEXbut the jobs are shifted backward. The size is identical to FSH\APEX. 

Experimental Design. Since we want to observe the behavior of the search when selecting the different 

neighborhoods, we propose three different approaches. To simplify the experiments and not be too 

dependent of randomness, instead of using a VNS, we first use a VND scheme. 

In the experiments, we will compare the three following algorithms: 
VND-R When a local optimum is reached, the next neighborhood is selected randomly among possible 

alternatives. The search terminates when all the neighborhoods cannot produce a better solution 

than the incumbent. 

VND-F This is the classical version of VND. In our experiments, we choose the order in which neigh- 
borhoods are introduced above. 

VND-A This is an adaptive version of VND. Every time a new neighborhood has to be selected, an 
investigation of all neighborhoods is run for a short number of iterations (say, 100) and the neigh- 
borhood producing the best solution at this time is selected for the descent method. This neighbor- 
hood is used until no more improvements are obtained. The investigation phase is run again, etc. 
The stopping condition is the same as before. 

To make a fair comparison, for all approaches we count the number of evaluations made during the 
searches and we shall use graphs indicating the quality of the best solution vs. the number of evaluations. 
To consider nested neighborhoods, the above exclusion may be abolished, too. 
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3 Preliminary results 

Experiments are conducted on the classical 100 job instances from Q. These instances have some 
advantages: best known solutions are stable (not improved for a long time), the number of instances is 
not too large (125) and these instances still seem difficult to solve (despite their relatively small size with 
100 jobs). In Figure[T]we provide as an example the results of the first run on the first of the investigated 
set of problem instances. In this case we find that VND-R takes longer to stop than VND-F, and that 
takes more time than VND-A. That is, we may deduce a positive aspect of the adaptive version of VND. 
We can also see that if we run VND for less than 750 evaluations, VND-R gives best results, and less 
than 834 evaluations give priority to VND-F. Of course, 834 evaluations are not enough to reach good 
quality and VND-A shows its strength here. 



This ideal situation was unfortunately not observed as a general behavior and we report considerably 
more details in the final version of the paper together with related conclusions and proposals for related 
adaptivity in VNS and VND with the final conclusion that adaptivity actually pays. 
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Figure 1 : Example run for the first instance of j2l 
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